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Abstract 

We  study  the  well-known  sociological  phenomenon  of  gang  aggregation  and  territory  formation  through  an 
interacting  agent  system  defined  on  a  lattice.  We  introduce  a  two-gang  Hamiltonian  model  where  agents  have 
red  or  blue  affiliation  but  are  otherwise  indistinguishable.  In  this  model,  all  interactions  are  indirect  and  occur 
only  via  graffiti  markings,  onsite  as  well  as  on  nearest  neighbor  locations.  We  also  allow  for  gang  proliferation 
and  graffiti  suppression.  Within  the  context  of  this  model,  we  show  that  gang  clustering  and  territory  formation 
may  arise  under  specific  parameter  choices  and  that  a  phase  transition  may  occur  between  well-mixed,  possibly 
dilute  configurations  and  well  separated,  clustered  ones.  Using  methods  from  statistical  mechanics,  we  study 
the  phase  transition  between  these  two  qualitatively  different  scenarios.  In  the  mean-fields  rendition  of  this 
model,  we  identify  parameter  regimes  where  the  transition  is  first  or  second  order.  In  all  cases,  we  have  found 
that  the  transitions  are  a  consequence  solely  of  the  gang  to  graffiti  couplings,  implying  that  direct  gang  to  gang 
interactions  are  not  strictly  necessary  for  gang  territory  formation;  in  particular,  graffiti  may  be  the  sole  driving 
force  behind  gang  clustering.  We  further  discuss  possible  sociological  -  as  well  as  ecological  -  ramifications  of 
our  results. 
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1.  Introduction 

Lattice  models  have  been  extensively  used  in  the  physical  sciences  over  the  past  decades  to  describe  a  wide 
variety  of  condensed  matter  equilibrium  and  non  equilibrium  phenomena  see  e.g.,  the  reviews  in  [6,  62,  44]. 
Magnetization  was  the  original  application,  but  the  list  has  grown  to  include  structural  transitions  in  DNA 
[35,  1,  51],  polymer  coiling  [59,  24],  cellular  automata  [21,  61],  and  gene  regulation  [49,  33,  58]  to  name  a  few. 
The  resulting  models  are  certainly  simplified,  but  what  they  lack  in  detail  is  compensated  by  their  amenability 
to  analytical  and  computational  treatment  -  and,  occasionally,  to  exact  solution.  Moreover,  at  least  for  the 
behavior  in  the  vicinity  of  a  continuous  transition,  the  simplifications  inherent  in  these  approximate  models  may 
be  presumed  to  be  inconsequential.  In  short,  lattice  models  have  proved  extremely  useful  in  the  context  of  the 
physical,  biological  and  even  chemical  sciences.  In  more  recent  years,  lattice  models  have  also  been  applied  to 


Preprint  submitted  to  Elsevier 


October  28,  2011 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
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study  social  phenomena  [14,  55,  45],  such  as  racial  segregation  [52,  30],  voter  preferences  [22,  31,  42],  opinion 
formation  in  financial  markets  [56,  63,  3],  and  language  changes  in  society  [34,  50,  53],  offering  insight  into 
socioeconomic  dynamics  and  equilibria.  In  this  paper  we  consider  the  problem  of  gang  aggregation  via  graffiti  in 
what  is  -  to  the  best  of  our  knowledge  -  the  first  application  of  lattice  model  results  to  the  emergence  of  gang 
territoriality. 

Scratching  words  or  painting  images  on  visible  surfaces  is  certainly  not  a  new  phenomenon.  Wall  scribblings 
have  survived  from  ancient  times  and  have  been  used  to  reconstruct  historical  events  and  to  understand  societal 
attitudes  and  values.  Today,  graffiti  (from  the  Italian  graffiare,  to  scratch)  is  a  pervasive  characteristic  of  all 
metropolitan  areas  [5].  Several  types  of  graffiti  exist.  Some  are  political  in  nature,  expressing  activist  views 
against  the  current  establishment;  others  are  expressive  or  offensive  manifestations  on  love,  sex  or  race.  At  times, 
the  graffiti  is  a  mark  of  one’s  passage  through  a  certain  area,  with  prestige  being  attributed  to  the  most  prolific 
or  creative  tagger  or  to  one  who  is  able  to  reach  inaccessible  locations.  The  mark  can  be  anything  from  a  simple 
signature  to  a  more  elaborate  decorative  aerosol  painting  [48,  4].  All  of  these  types  of  graffiti  are  usually  scattered 
around  the  urban  landscape  and  do  not  appear  to  follow  any  predetermined  spatio-temporal  pattern  of  evolution. 
They  affect  the  quality  of  life  simply  as  random  defacement  of  property,  although  sometimes  they  are  considered 
art  [36]. 

On  the  other  hand,  gang  graffiti  represents  a  much  more  serious  threat  to  the  public,  since  it  is  usually  a  sign 
of  the  presence  of  criminal  gangs  engaged  in  illegal  or  underground  activities  such  as  drug  trafficking  or  extortion 
[37,  28].  Street  gangs  are  extremely  territorial,  and  aim  to  preserve  economic  interests  and  spheres  of  influence 
within  the  neighborhoods  they  control.  A  gang’s  “turf”  is  usually  marked  in  a  characteristic  style,  recognizable 
to  members  and  antagonists  [13,  40]  with  incursions  by  enemies  often  resulting  in  violent  acts.  The  established 
boundaries  between  different  gang  factions  are  sometimes  respected  peacefully,  but  more  often  become  contested 
locations  where  it  is  not  uncommon  for  murders  and  assaults  to  occur  [11].  It  is  here,  on  the  boundaries  between 
gang  turfs,  that  the  most  intense  graffiti  activity  is  usually  concentrated. 

Several  criminological  and  geographical  studies  have  been  presented  connecting  gang  graffiti  and  territoriality 
in  American  cities  [36,  4,  40].  In  particular,  it  is  now  considered  well-established  that  the  spatial  extent  of  a  gang’s 
area  of  influence  is  strongly  correlated  to  the  spatial  extent  of  that  particular  gang’s  graffiti  style  or  language. 
Furthermore,  it  is  known  that  the  incidence  of  gang  graffiti  may  change  in  time,  reflecting  specific  occurrences 
or  neighborhood  changes.  For  example,  rival  gangs  may  alternate  between  periods  of  truce  and  hostility,  the 
latter  being  triggered  by  arrests  or  shootings.  Similarly,  boundaries  may  shift  locations  when  the  racial  or  socio¬ 
economic  makeup  of  a  neighborhood  changes,  creating  new  tensions,  or  when  gang  members  migrate  to  new 
communities  [5].  In  all  these  cases,  periods  of  more  intense  gang  hostility  are  usually  accompanied  by  intense 
graffiti  marking  and  erasing  by  rival  factions  in  contested  or  newly  settled  boundary  zones  [40]. 

The  purpose  of  this  paper  is  to  present  a  mathematical  model  that  includes  relevant  sociological  and  geo¬ 
graphical  information  relating  gang  graffiti  to  gang  activity.  In  particular,  we  study  the  segregation  of  individuals 
into  well  defined  gang  clusters  as  driven  by  gang  graffiti,  and  the  creation  of  boundaries  between  rival  gangs.  We 
use  a  spin  system  akin  to  a  2D  lattice  Ising  model  to  formulate  our  problem  through  the  language  of  statistical 
mechanics.  In  this  context,  the  site  variables  s,;  have  two  constituents  which  represent  ‘gang’  and  ‘graffiti’  types, 
respectively,  and  phase  separation  is  assumed  to  be  the  proxy  for  gang  clustering.  For  the  purpose  of  simplicity, 
we  consider  only  two  gangs,  hereafter  referred  to  as  the  red  and  blue  gang,  whose  members  we  refer  to  as  agents. 
Lattice  sites  may  be  occupied  by  agents  of  either  color  or  be  void.  Since  gang  members  tag  their  territory,  we 
also  assign  a  graffiti  index  to  each  site  representing  the  preponderance  of  red  or  blue  markings  -  the  color  to  be 
associated  with  the  corresponding  gang-type.  In  particular,  agents  are  attracted  to  sites  with  graffiti  of  their 
same  color,  and  avoid  locations  marked  by  their  opponents.  We  deliberately  avoid  including  direct  interactions 
between  gang  members,  so  that  “ferromagnetic”  type  gang-gang  atractions  exist  only  insofar  as  they  are  medi¬ 
ated  by  the  graffiti.  On  one  hand  this  is  mathematically  interesting:  in  the  broader  context  of  physical  systems, 
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interactions  are  often  mediated  but  rarely  are  indirect  interactions  the  subject  of  mathematical  analysis.  On  the 
other  hand,  by  excluding  direct  gang  interactions,  we  can  specifically  focus  on  the  role  of  graffiti  in  gang  dynamics 
and  segregation.  Furthermore,  as  will  be  later  discussed,  under  certain  conditions,  gang-gang  couplings  may  be 
unimportant,  and  one  of  the  primary  conclusions  of  this  work  is  that  they  appear  to  be  unnecessary  to  account 
for  the  observed  phenomena  of  gang  segregation.  In  any  case,  we  informally  state  without  proof  that  all  the 
results  of  this  work  also  hold  if  explicit  agent-agent  interactions  are  included. 

We  thus  write  ,s,  =  (rg,  g% ) ,  representing  the  agents  and  graffiti  configuration  at  site  i,  respectively.  The  former 
component  r]i  is  discrete  allowing,  for  simplicity,  at  most  one  agent  on  each  site.  The  latter  gi  is  continuous  and, 
in  principle,  unbounded.  We  let  s  denote  a  spin  configuration  on  the  entire  lattice,  and  in  Section  2,  propose 
a  Hamiltonian,  to  embody  all  relevant  sociological  information.  Once  has  been  determined,  the 

probability  for  the  occurrence  of  a  spin  configuration  s  on  a  finite  connected  lattice  A  C  Z2  is  determined  by  the 
corresponding  Gibbs  distribution  F(s).  Note  that  due  to  the  choices  made  on  the  range  of  the  rg ,  (ji  values,  F(s) 
is  discrete  in  the  g  variables  and  continuous  in  the  g  ones.  It  is  given  by 

F(s)  =  -^exp(-JT(s)), 

where  Z  is  the  partition  function  for  the  finite  lattice  A  formally  provided  by  the  expression 

Z  =  ^exp(-JT(s)). 
ses 

Here,  S  denotes  the  set  of  all  possible  configurations  on  A  and  the  summation  symbol  is  understood  to  be  a 
summation  over  the  discrete  components  and  an  integration  over  the  continuous  ones.  As  usual,  we  begin  with  a 
finite  lattice  and  its  associated  boundary  conditions,  and  obtain  infinite  volume  results  by  taking  the  appropriate 
limits.  Using  techniques  from  statistical  mechanics,  we  prove  that  our  system  undergoes  a  phase  transition  as  the 
coupling  parameters  are  varied.  In  the  unconstrained  ensemble,  certain  parameter  choices  lead  to  predominance 
of  either  the  red  or  blue  gang,  indicating  that  for  configurations  where  the  red  to  blue  gang  ratio  is  fixed  at  unity, 
a  phase  separation  will  occur.  Conversely,  in  other  regions  of  parameter  space,  there  is  no  dominance  of  either 
gang  type,  indicating  that  the  two  are  well-mixed  and/or  dilute.  In  this  work  we  will  investigate  under  which 
conditions  to  expect  phase  separation  or  gang  dilution. 

Our  paper  is  organized  as  follows:  in  Section  2,  we  give  details  of  the  model  and  in  Section  3,  we  prove 
that  a  phase  transition  exists  as  a  function  of  the  relevant  parameters.  Since  information  on  the  location  of  all 
transition  points  is,  by  necessity,  incomplete  we  consider  an  approximation  in  the  form  of  a  simplified  mean  field 
version  of  our  Hamiltonian  and  derive  the  corresponding  mean  field  equations  in  Section  4.  Here,  we  show  that 
the  mean  field  Hamiltonian  also  exhibits  a  phase  transition  and  we  further  prove  that  the  latter  is  continuous  in 
one  specified  region  of  parameter  space  and  first  order  in  another.  Finally,  in  Section  5  we  end  with  a  discussion 
of  potential  sociological  and  ecological  implications  of  our  results. 

2.  The  Hamiltonian 

Let  us  define  a  spin  system  on  a  finite  lattice  A  C  Z2.  Here,  the  spin  at  each  site  i  £  A  is  denoted  by 
si  =  and,  we  reiterate,  rp  denotes  the  agent  spin  and  gi  represents  the  graffiti  field.  We  allow  the  agent 

spin  to  be  in  the  set  {0,  ±1};  iy,  =  —1  if  the  agent  at  site  i  belongs  to  the  blue  gang,  rj. j  =  0  if  there  is  no  agent, 
and  r/i  =  +1  if  the  agent  is  a  red  gang  member.  The  graffiti  field  is  in  the  set  of  real  numbers:  gi  >  0  indicates 
an  excess  of  red  graffiti,  gi  <  0  an  excess  of  blue  graffiti,  and,  in  either  case,  \gf  indicates  the  magnitude  of  the 
excess.  We  now  introduce  the  formal  Hamiltonian  Jf'(s) 
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(1) 


-Jf(s)  =  J  Y  Vi9j  +  K  Y  V i9i  +  a  Y  3i  ~  A  Y1  sh 
<i,j>  i  i  i 

where  s  is  a  given  configuration  on  the  full  A  lattice,  i  and  j  index  its  sites  and  Y^(i  j)  is  the  sum  taken  over  every 
bond  between  nearest  neighbor  sites  belonging  to  A.  We  postpone  explicit  mention  of  boundary  conditions  and 
of  the  interactions  between  spins  in  A  and  spins  on  the  lattice  boundary  Ac.  The  above  will  be  referred  to  as 
the  GI-Hamiltonian  (graffiti  interaction  Hamiltonian)  and  its  corresponding  partition  function  will  be  denoted 
by  an  unadorned  Z.  Note  that  since  rji  is  either  0  or  ±1,  rjf  =  \r]i |;  however,  we  choose  to  display  the  above 
form  to  leave  open  the  possibility  of  r]i  £Z.  As  discussed  earlier,  there  are  no  explicit  agent-agent  interactions 
in  this  model;  indeed,  the  structure  of  the  Hamiltonian  assumes  that  gang  members  interact  with  each  other 
only  via  the  graffiti  tagging.  As  a  result,  occupation  at  site  i  by  a  gang  member  is  “energetically”  favored 
only  if  nearest-neighbor  and  on-site  graffiti  are  predominantly  of  its  same  color.  The  two  coupling  constants,  J 
for  nearest-neighbor  interactions  and  K  for  onsite  occupation,  reflect  this  trend.  The  arjf  term  represents  the 
proclivity  of  a  given  site  to  be  occupied  by  agents  regardless  of  color,  implying  that  gang  members  carry  a  strong 
tendency  to  occupy  unclaimed  turf  if  a  1,  while  a  <C  —  1  represents  a  natural  paucity  of  gangs  altogether. 
Finally,  we  assume  graffiti  imbalance  of  either  color  to  be  energetically  unfavorable  via  the  —A gf  term.  This 
can  be  interpreted  as  natural  decay  of  graffiti  due  to  the  elements,  or  to  police  or  community  intervention.  For 
purposes  of  stability,  A  must  be  positive.  Although  the  interactions  J,  K  are  tacitly  assumed  to  be  positive, 
generalizations  to  negative  values  may  be  possible,  and  a  corresponding  analysis  may  be  undertaken  given  the 
proper  sociological  interpretations. 

3.  Phase  transition  in  the  GI— system 

3.1.  Low  temperature  phase 

The  basic  strategy  we  follow  to  demonstrate  an  ordered,  “low  temperature”  phase  is  a  contour  argument,  here 
illustrated:  Suppose  that  rg ,  the  agent  spin  at  site  i,  differs  from  the  agent  spin  rjj  at  a  different  site  j.  The  two 
agent  spins  can  differ  either  by  color,  representing  two  different  gang  affiliations,  or  by  occupation,  where  one 
site  is  occupied  and  the  other  is  void.  At  the  scale  of  nearest  neighbors,  each  edge  in  the  lattice  can  be  defined 
as  either  a  coherent  bond,  where  the  adjoining  lattice  sites  are  occupied  and  their  agent  spins  are  identical,  or 
as  an  incoherent  bond  if  this  condition  does  not  hold.  Thus,  explicitly,  ( T]iirJj )  =  (1>  1)  or  (—1,  —1)  are  coherent, 
and  all  the  other  types  are  not. 

Let  us  now  consider  any  path  on  the  lattice  that  joins  sites  i  and  j.  Since  i  and  j  have  agent  spins  which  are 
not  identical,  it  must  be  the  case  that  on  any  path  between  i  and  j,  there  is  an  incoherent  bond.  Furthermore, 
these  incoherent  bonds  must  form  a  closed  contour  on  the  dual  lattice  that  separates  i  from  j.  In  the  following 
subsections,  we  derive  a  bound  on  the  probability  of  any  such  incoherent  bonds  and  their  aggregation  into 
contours.  When  these  probabilities  are  small  enough  -  which  happens  in  certain  regions  of  the  parameter  space 
-  we  can  establish  a  low  temperature  phase.  For  example,  the  presence  of  a  red  agent  at  the  origin  will  imply 
that,  with  significant  probability,  the  majority  of  the  other  sites  will  also  be  occupied  by  red  agents,  showing  the 
existence  of  a  red  phase.  Similarly,  a  blue  phase  can  be  shown  to  exist. 

To  achieve  all  of  these  ends,  we  will  employ  the  methods  of  reflection  positivity  described  in  [7]  and  [54]  which 
contain  a  detailed  account  of  useful  techniques  along  with  relevant  classic  references.  In  this  paper,  we  will  be 
working  on  the  Lx  L  diagonal  2D  torus  -  the  SST  -  which  we  denote  by  Tj,.  We  will  often  refer  to  the  Gibbsian 
probability  measure  on  associated  with  the  Hamiltonian  in  Eq.(l)  which  we  denote  by  P/,(-). 
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3.1.1.  Reflection  positivity 

By  means  of  the  reflection  positivity  of  the  Gibbs  distribution  we  can  easily  bound  the  expectation  of  an 
observable  which  depends  only  on  the  spin  at  any  two  neighboring  lattice  points.  This  result  will  be  used  to 
build  the  contour  argument  that  will  lead  us  to  prove  the  existence  of  a  low  temperature  phase.  We  thus  briefly 
introduce  the  concept  of  reflection  positivity,  referring  the  interested  reader  to  [7]  for  a  more  detailed  discussion 
of  these  topics. 

Consider  a  plane  of  reflection  p  which  intersects  the  torus  in  a  path  running  through  next  nearest  (diagonal) 
pairs  of  sites.  Let  Dp  be  the  reflection  operator  through  p.  On  the  SST,  this  plane  p  divides  the  lattice  into  two 
halves,  identified  as  Tj  and  T~f,  such  that  Tj  nT£  =  p.  Let  denote  the  set  of  functions  which  depend  only 
on  the  spin  variables  in  Tj  and  similarly  for  .  The  reflection  map ,  Dp,  which,  in  a  natural  fashion  identifies 
sites  in  Tj  with  those  in  T f  via  a  reflection  through  p.  can  also  be  used  to  define  maps  between  and  : 
Specifically,  if  /  €  °Mp  ,  we  define  Dpf  €  to  be  the  function  /  evaluated  on  the  configuration  reflected  from 

TZ- 

A  measure  p  is  reflection  positive  with  respect  to  Dp  if  for  every  f,g  £  ^p+,  or  ,  the  following  two 
properties  hold 

1.  E„(/V)  >  0, 

2.  E p(fDpg)  =  Ep(gDpf). 

It  is  known  (e.g.,  see  [7])  that  is  reflection  positive  with  respect  to  Dp  for  every  p  of  the  above  described  type. 
We  next  use  reflection  positivity  to  find  an  upper  bound  on  the  expectation  of  observables  defined  on  bonds.  In 
doing  so,  we  use  the  following  lemmas: 

Lemma  3.1  Let  (i,j)  denote  a  bond  o/T^  and  let  a*  and  "fj  denote  site  events  at  the  respective  endpoints  of 
the  bond.  Let  denote  the  partition  function  (on  T/J  which  has  been  constrained  so  that  at  each  site  with 

the  parity  of  i,  the  translation  of  the  event  a.i  occurs  and  similarly  for  7.  Then,  for  L  =  2k  for  some  integer  k, 

^L{oti  n  7j)  < 


-T  L 

ZrL 


where  V  =  L2  is  the  volume  of  the  torus. 


Proof.  The  result  from  this  Lemma  dates  back  to  the  original  papers  on  the  subject.  In  particular,  the  use  of 
bond  events  on  the  SST  was  highlighted  in  [54].  A  modern  and  complete  derivation  is  contained  in  [7],  Section 
5.3.  □ 

For  a  slightly  more  general  scenario,  let  us  consider  the  bond  (i,j)  and  various  events  a\ ,  7] ,  . . . ,  and  let 

us  denote  by  bi  =  a\  fl  jj  . . .  bn  =  a ”  fl  7"  the  corresponding  bond  events  as  described.  Letting  b  =  U "=16j  we 
find 


r L(b)  <  Y, 

3= 1 


\ 

2V  n 

—  v 

1 - 

1 _ 

1 

_ l 

3= 1 

- - 1 

_ 1 

Finally,  we  have 
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Lemma  3.2  Let  r\, . . .  rm 

described  above.  Then 


denote  translations  of  the  bond  (i,j)  and  brj  the  translation  of  the  bond  event(s)  b 


In 

"  7(U)' 

1  -1 

2V 

P. L(n?=1brj)  < 

W31 

_ 1 

L 

ZfL 

Proof.  Again,  we  refer  the  reader  to  [7],  Section  5.3. 


□ 


3.1.2.  A  bound  on  the  incoherent  bond  probabilities 

In  order  to  prove  a  phase  transition  by  a  contour  argument,  we  must  place  an  upper  bound  on  the  probability 
for  the  occurrence  of  any  type  of  incoherent  bond  where  agent  spins  of  neighboring  sites  are  different.  There  are 
four  types  of  incoherent  bonds,  namely  (rji:  rjj)  =  (—1, 1),  (—1,  0),  and  (1,  0),  and  (0,  0),  regardless  of  order.  Let 
us  introduce  the  following  notation:  consider  undirected  bonds  between  two  particular  neighboring  lattice  sites, 
( i,j )  and  let  (•,  •)  denote  the  event  of  any  of  the  nine  coherent  or  incoherent  bonds  so  that 


(•>  •)  €  {(+>+)>  (— > —  )j  (+i  “)>(“>+))(+)  0),  (0,  +),(—,  0),  (0,  — ),  (0, 0)}. 

Similarly,  let  Zj.)  ^  denote  the  partition  function  restricted  to  configurations  where  all  agent  spins  are  frozen  in 
accord  with  the  above  described  (chessboard)  pattern  and  the  rest  of  the  statistical  mechanics  is  provided  by  the 
graffiti  field  against  this  background.  The  following  is  readily  obtained: 

Proposition  3.3  The  above  described  (agent- constrained)  partition  functions  are  given  by 


-rfO.O) 

^Tl 

r^i 

Va. 

V 

5 

d- -) 

jr(+i  +  ) 

eaVK  lUJ+Kf] 

'Tl 

[  Vx 

J 

d+.~) 

_  ■?(— .+) 

e<XV^  0M-^J+K\ 

'Tl 

—  'l 

[  Vx 

^-(  —  .0) 

I"  e?a 

V 

•  - 

7a 

Proof.  Since  the  agent  variables  are  frozen,  the  gz  Gaussian  variables  are  independent  and  the  above  amount  to 
straightforward  Gaussian  integrations.  □ 

Using  Lemma  3.1  and  the  fact  that  the  full  partition  function  satisfies  ZjL  >  Zj+’+\  we  can  write 


P  l(+,- 

Pl(0,0) 

-)=Pl(-,+) 

<  e-3“e-5 S.W+K}\ 

2  JK 

<  e  *  , 

(2) 

Pl(+,0)  = 

•  •  •  =  Pl(0,  — ) 

1  2  J2  +  JK+t^K2 

<  e~^ae - — 

We  denote  by  e  =  s(J,  K,  X,a)  the  sum  of  the  estimates  for  the  probabilities  provided  by  the  right  hand  sides 
of  the  preceding  display.  For  fixed  a  and  K  >  0,  note  that  that  as  J A-1/2  -»  oo  (or,  better  yet,  J A-1/2  and 
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K\~lt2  both  tending  to  infinity)  the  quantity  e  tends  to  zero.  This  implies  the  suppression  of  all  incoherent 
bonds  so  that  the  lattice  must  be  almost  fully  tiled  with  coherent  ones.  In  particular,  the  lattice  is  nearly  filled 
with  agents,  which,  at  least  locally,  are  mostly  of  the  same  type.  As  will  be  demonstrated  below,  this  implies  the 
existence  of  distinctive  red  and  blue  phases,  i.e.,  in  the  language  of  statistical  mechanics,  of  a  “low  temperature” 
regime.  We  formalize  this  result  in  the  next  subsection. 

3.1.3.  The  contour  argument 

We  have  now  established  all  the  tools  we  need  to  complete  the  contour  argument.  Accordingly,  we  now  show 
that  two  well-separated  lattice  sites  must,  with  probability  tending  to  one,  have  identical  agent  spins  in  the  limit 
eCl.  This  in  turn  will  imply  the  existence  of  a  low  temperature  phase. 

Theorem  3.1.  Consider  the  Gl-system  on  Z2  and  let  e(J,  K,  A,  a)  denote  the  quantity  described  in  the  last 
paragraph  of  the  previous  subsection.  Then,  if  the  parameters  are  such  that  e  is  sufficiently  small,  there  are 
at  least  two  distinct  limiting  Gibbs  states  characterized,  respectively,  by  the  abundance  of  red  agents  and  the 
abundance  of  blue  agents.  Moreover,  this  property  holds  in  any  limiting  shift  invariant  Gibbs  state. 

Proof.  Let  us  start  on  with  L  =  2k.  For  i,j  £  Tl  where  i  and  j  are  well  separated,  let  us  consider  the  event 
vb  '■=  {Vi  7^  Vj}  U  {rji  ^  0}.  We  will  show,  under  the  stated  conditions,  that  uniformly  in  L  this  probability 
vanishes  as  e  — >  0.  As  discussed  previously,  in  order  for  this  event  to  occur,  the  sites  i  and  j  must  be  separated 
by  a  closed  contour  consisting  of  bonds  dual  to  incoherent  bonds.  For  £  =  4, 6, . . .  let  91^  =  Tlffi  —  j,  L)  denote 
the  number  of  such  contours  of  length  £  on  T^.  Then  we  claim  that  uniformly  in  L  and  i  —  j, 

Tb  <  2£2Xe2 

where  A2  (with  A2  ~  2.638...  <  3)  is  the  connectivity  constant  for  Z2.  A  word  of  explanation  may  be  in  order. 
The  \2  generously  accounts  for  walks  of  length  £  in  the  vicinity  of  site  i  and  the  factor  of  two  for  walks  in  the 
vicinity  of  site  j.  Finally,  the  factor  of  £2  accounts  for  the  origin  of  the  walk.  Note  this  is  an  over-counting,  e.g., 
contours  which  wind  the  torus  but  do  not  necessarily  “enclose”  i  or  j  are  counted  twice.  Using  Lemma  3.2  we 
may  now  write 

e  e-.m^o 

The  above  obviously  tends  to  zero  as  £  — >  0  demonstrating  that  in  finite  volume,  the  lattice  is  either  populated 
with  mostly  red  agents  or  mostly  blue  agents  depending  -  with  high  probability  -  on  what  is  seen  at  the  origin. 
The  implication  of  this  result  is  that,  for  £  sufficiently  small,  there  are  at  least  two  infinite  volume  Gibbs  states 
-  which  can  be  realized  as  the  limits  of  the  appropriately  conditioned  TVs.  These  states  have  one  of  the  two 
mutually  exclusive  characteristics:  a  preponderance  of  red  agents  or  a  preponderance  of  blue  agents.  The  fact 
that  the  above  must  also  hold  in  any  shift  invariant  Gibbs  state  is  the  subject  of  Theorem  2.5  and  its  Corollary 
in  [8]  with  a  slight  extension  provided  by  Corollary  5.8  in  [15].  □ 

3.2.  High  temperature  phase 

As  is  sometimes  (e.g.,  historically)  the  case  in  statistical  mechanics,  it  can  be  an  intricate  job  to  establish  a 
high  temperature  phase  -  a  region  of  parameters  where  the  limiting  Gibbs  measure  is  unique  and  correlations 
decay  rapidly.  Typically,  one  calls  upon  the  Dobrushin  uniqueness  criterion  [23].  However  for  us,  this  route  is 
interdicted  by  the  unbounded  nature  of  the  gi  graffiti  field.  The  strategy  here  will  be  percolation  based:  First 
we  establish  the  so-called  FKG  property  for  all  the  associated  Gibbs  measures.  Then  what  follows  will  be  a 
relatively  standard  argument  through  which  we  show  that  the  necessary  and  sufficient  condition  for  uniqueness 
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is  that  the  average  of  rji  -  akin  to  a  magnetization  -  vanishes  in  the  state  designed  to  optimize  this  quantity. 
Then,  finally,  we  will  develop  a  random  cluster-type  expansion  demonstrating  that  under  the  expected  high 
temperature  conditions  for  the  couplings,  e.g.,  A  1,  the  stated  condition  on  this  magnetization  is  satisfied.  In 
addition,  high-temperature  behavior  should  also  be  achieved  under  the  condition  that  agents  are  sparse.  This 
requires  an  alternative  percolation  criterion  used  in  conjunction  with  the  above  mentioned  expansion.  In  both 
scenarios,  the  rapid  decay  of  correlations  arises  as  an  automatic  byproduct. 

3.2.1.  F KG  properties 

In  this  paragraph  we  will  demonstrate  that  the  FKG  Lattice  Condition  (see  e.g.,  [41]  Page  78)  is  satisfied  by 
any  finite  volume  Gibbs  measure  associated  with  the  GI  Hamiltonian.  Let  us  start  by  noting  that  we  can  define 
a  natural  partial  ordering  on  the  pair  of  states  Sj  and  s'  via  the  notation 

Si  h  4,  if  rji  >  rj'i  and  g,  >  g\. 

Further  we  introduce  the  notation  s  A  s'  to  signify  that  the  above  holds  for  all  the  s*,  s'  at  each  i  £  A.  For 
individual  spins  s,  and  s',  we  also  denote  s*  V  s'  :=  (maxjjjj, 77'},  maxj^j, gb})  and  similarly  for  the  “minimum” 
Si  A  s'.  Finally,  for  spin  configurations  s  and  s',  the  configurations  s  V  s'  and  s  A  s'  are  defined  as  the  sitewise 
maximum  and  minimum,  respectively.  The  FKG  lattice  condition  -  conveniently  stated  for  finite  volume  measures 
-  is  that  for  all  s,  s',  the  following  inequality  holds: 

FA(s  V  s')Fa(s  A  s')  >  Fa(s)Fa(s').  (3) 

The  well  known  consequence  of  the  above  is  that  any  pair  of  random  variables  that  are  both  increasing  with 
respect  to  the  partial  order  described  above  are  positively  correlated. 

Proposition  3.4  The  finite  volume  Gibbs  measures  associated  with  the  GI-Hamiltonian  satisfy  the  FKG  lattice 
condition. 

Proof.  We  consider  an  arbitrary  graph  and,  as  will  be  made  evident,  the  proof  automatically  accounts  for  any 
fixed  boundary  conditions.  Now,  as  is  well  known,  it  is  sufficient  to  establish  that  the  lattice  condition  Eq.(3) 
holds  when  differences  between  configurations  are  exhibited  only  on  a  pair  of  spin- variables.  The  fixed  boundary 
spins  thus  may  be  regarded  as  part  of  the  background  which  is  common  to  all  four  possible  agent-graffiti  spin 
configurations  in  question.  Let  us  thus  assume  that  the  differences  between  two  configurations  occur  at  sites  a 
and  b  in  the  graph  where  certain  specified  variables  have  been  “raised”  above  a  base  configuration  level  s.  We 
denote  the  single  raise  configurations  by  sa  and  sb  and  the  double  raise  by  sab.  Thus,  it  is  sufficient  to  show 
F(safc)F(s)  >  F(sa)F(s(,).  All  told,  there  are  three  possibilites  to  consider:  graffiti-graffiti,  gang-graffiti  and  gang- 
gang  raises  on  the  a  and  b  sites.  For  the  mixed  gang-graffiti  case  we  must  also  consider  the  a  =  b  possibility  where 
the  gang  and  graffiti  spins  have  been  “raised”  at  the  same  site.  We  need  not  consider  the  normalization  constant 
in  any  of  these  cases,  since  it  appears  in  identical  roles  on  both  sides  of  the  purported  inequality;  consideration 
of  the  Boltzmann  factors  is  sufficient.  Let  us  introduce,  in  the  setting  of  our  general  graph,  the  interaction 

-  Y^laiVi  +  \9i], 

(hi)  * 

where  the  first  sum  now  extends  over  all  edges  considered  to  be  part  of  the  graph  and  our  only  stipulation  is 
that  Jij  >  0.  Also,  we  may  formally  include  i  =  j  in  this  sum.  Let  us  denote  the  “raised”  graffiti  variables 
via  the  positive  increments  6ga  and  Sgb  so  that,  in  the  graffiti-graffiti  case,  at  sites  a  and  b  ga  — ►  ga  +  6ga  and 
gb  -A  gb  +  5gb.  It  is  straightforward  to  see  that  Jtf(sab)  +  Jtf(s)  =  J$?( sa)  +  Jt?( sb)  and  the  desired  inequality 


holds  as  an  identity.  Similarly  for  the  gang-gang  case.  We  can  now  consider  the  mixed  case  where,  without  loss 
of  generality,  ga  — t  ga  +  dga  and  ijb  — >  rjb  +  drjbi  and  for  us,  6rjb  =  1-  Here 


while 


However 


(tJ^?(Sa)  Af’(s))  ^  '  Ji,aVi^9a  T  Ja,bVb^9a  A(5 9a  ? 

i^b 

-(Jf(sb)  -  J?(s))  =  ^2  Jb,j9jfiVb  +  Ja,b9aSvb  +  aSgb2  ■ 
jAb 


-(J?(Bab)  -  JP(S))  = 


> 


^  J-i,a9i^ 9a,  T  Ja,b^b^ 9a  X6(Jn  ]  T  \ ^  Jb,j 9j^Vb  'A  Ja,b9a^9b  T  Jb~]  T  d{lllSfJlid7jb 

i^b  j^b 

2jT(s)  -  Jf’(Sa)  -  J^(sb). 


Combining  the  above  results  we  find  that  indeed 

J^(sab)  d-  Jf’(s)  <  s0)  +  Jt?(Sb). 

The  same  inequality  can  be  easily  shown  in  the  mixed  gang-graffiti  case  for  a  =  6,  by  assuming  ga  — >  ga  +  Sga 
and  T]a  — ^  9a  T  ^Va  and  by  following  the  same  steps  as  above.  This  completes  the  proof.  □ 


As  an  immediate  consequence,  we  can  identify  boundary  conditions  on  A  which  most  favor  the  dominance  of  the 
red  gang.  Indeed,  it  is  now  seen  -  as  was  anyway  clear  heuristically  -  that  we  must  make  the  boundary  spins  “as 
red  as  possible”  in  order  for  a  predominance  of  A  sites  to  be  occupied  by  red  agents.  This  amounts,  somewhat 
informally,  to  setting  gi  =  +oo  and  r/.j  =  1  (which  is  anyway  automatic  if  K  ^  0)  all  along  the  boundary.  This 
“specification”  which  seems  a  bit  arduous  to  work  with  is  not  nearly  as  drastic  as  it  sounds.  Let  us  start  with 
some  notation:  For  A  a  finite  subset  of  Z2,  let  us  define  <9  A  as  those  sites  in  Ac  with  a  neighbor  in  A  and  dA 
as  those  sites  in  A  with  a  neighbor  in  Ac.  Clearly  the  only  immediate  consequence  of  the  “drastic”  boundary 
condition  is  to  force  rji  =  1  for  i  £  dA  and  to  bias,  by  at  most  ( 3J  +  K)gt  the  a  priori  Gaussian  distribution  of  the 
g’s.  We  shall  do  that  -  and  a  bit  more  -  on  dA  arguing  that  this,  at  most,  is  the  result  of  the  “drastic”  boundary 
condition  on  d(A  U  dA).  Precisely,  we  define  the  red  boundary  condition  on  A  as  rji  =  1  and  g.j  independently 
distributed  as  normal  random  variables  with  variance  1/2A  and  mean  (4J  +  K)/2X  for  each  i  £  dA.  By  the 
the  established  nronotonicity  properties  these  are  exactly  the  boundary  conditions  imposed  on  the  slightly  larger 
lattice  that  will  optimize  the  average  of  rji  and  g for  any  i  £  A. 


3.2.2.  A  uniqueness  criterion 

It  is  not  hard  to  show,  by  monotonicity,  that  a  limiting  red  measure  exists  along  any  thermodynamic  sequence  of 
volumes  and  that  the  limit  is  independent  of  the  sequence  and  therefore  translation  invariant.  We  shall  denote 
this  measure  by  /-Ir(-)  and  by  Er(-)  the  corresponding  expectations.  Similarly  for  the  blue  measure  we  introduce 
/xb(-)  and  Eb(-)-  We  can  thus  state 

Proposition  3.5  The  necessary  and  sufficient  condition  for  uniqueness  among  the  limiting  Gibbs  states  for  the 
Gl-system  is  that  Er(?7o)  =  0,  where  r/o  Is  the  spin  at  the  lattice  origin. 

Pi'oof.  For  two  measures  pi  and  /i2  e.g.,  on  {—1,0, 1}““  ,  we  use  the  notation  pi  >  H2  to  indicate  that  for  any 
random  variable  X  which  is  increasing  in  all  coordinates,  the  expected  values  E1(A'),  calculated  via  the  pi 
measure  are  always  greater  than  the  those  obtained  via  g.2'- 

EipO  >  E2{X). 
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This  is  known  as  stochastic  dominance.  Consider  /.jr(-)  which,  by  slight  abuse  of  notation,  we  temporarily  take 
to  be  the  restriction  of  HR  to  agent  events.  Suppose  that  Er(t7o)  =  0.  Then,  by  translation  invariance,  we 
have  Er(tjj)  =  0  for  all  i.  Similar  considerations  apply  to  the  corresponding  hb{-)-  It  is  immediately  clear  -  by 
symmetry  or  stochastic  dominance  -  that  Pr(7/0  =  0)  =  Pb(?7o  =  0)  and  thus  the  single  site  distributions  are 
identical.  By  the  corollary  to  the  Strassen  theorem  [57,  43]  since  >  ^tB  and  these  measures  have  identical  single 
site  distributions  they  must  be  identical  probability  measures.  Similar  considerations  apply  to  the  full  measures 
since  the  distribution  of  the  gt  is  determined  by  their  conditional  distributions  given  the  local  configuration  of 
the  r]iS.  Uniqueness  is  established  since,  if  /%(•)  denotes  any  other  infinite  volume  measure  associated  to  the 
GI-Hamiltonian,  we  have  pb  >  Hq  >  /ie  which  implies  equality  in  light  of  /tr  =  hb-  □ 

3.2.3.  Proof  of  a  high-temperature  phase 

We  shall  develop  a  graphical  representation  for  the  Gl-system  akin  to  the  FI<  representation  for  the  Potts 
model  [29]  that,  for  all  intents  and  purposes,  is  the  same  as  the  one  used  in  [10],  where  only  the  case  of  bounded 
fields  is  explicitly  analyzed.  Let  us  then  consider  the  GI-Hamiltonian  in  finite  volume  with  all  notation  pertaining 
to  boundary  conditions  temporarily  suppressed.  For  fixed  s,  we  may  decompose  the  graffiti  fields  and  agents 
according  to  affiliation: 


g%  =  =  ±1,  %  =  [^1, 

Vi  =  n<?i;  o-j  =  ±i,  n  =  \vi\, 

where  the  er’s  and  id’s  have  the  definitive  character  of  Ising  variables.  We  can  now  write 

.  e  •  n. 

where  Rti]  =  qt,  rf)  :=  e2Ji’jqirj  —  1.  In  our  case,  we  have  =  ./  if  i  and  j  are  neighboring  pairs  and 

,  =  K ;  which  we  will  not  yet  distinguish  notationally  and  consider  a  general  J,;j  label.  Thus 

e-^(s)  =  e  +  1). 

(i,j) 

Opening  the  product,  we  select  one  term  for  each  “edge”:  If  the  Rij  term  is  selected,  we  declare  the  edge  to 
be  occupied ,  otherwise  it  is  vacant.  It  is  noted  here  that  the  edges  should  be  interpreted  as  directed:  all  edges 
appear  twice  and  we  must  regard  (i,j)  as  distinctive  from  (j,  i):  moreover,  for  K  0,  the  above  is  understood  to 
include  i  =  j.  The  configurations  of  occupied  edges  will,  generically,  be  denoted  by  oj.  Summing  over  the  Ising 
variables,  we  acquire  the  weights 


^)  =  2C(u)E  II  KiAJid^rj), 

q.r 

where,  as  before  the  summation  notation  also  indicates  integration  over  the  continuous  variables.  In  the  above, 
C(u>)  denotes  the  number  of  connected  components  of  u;  here  connectivity  deduced  according  to  the  directed 
nature  of  the  edges  or  via  a  double  covering  of  the  lattice.  Normalizing  these  weights  by  the  partition  function 
we  obtain  a  probability  measure  on  the  bond  configurations  u>.  As  will  be  made  explicit  below,  this  probability 
measure  on  bond  configurations  is  well  defined  in  finite  volume.  Let  us  denote  the  probability  measure  on  the 
bond  configurations  oj  by  P®(-),  where  the  ©  now  denotes  boundary  conditions  accounted  for  in  a  routine  fashion. 
Then,  for  each  oj  consisting  of  appropriate  edges,  P®(w)  €  (0, 1).  We  shall  not  discuss  the  problem  of  infinite 
volume  limits  which  would  take  us  too  far  astray  but  be  content  with  statements  that  are  uniform  in  volume. 
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With  regards  to  the  latter,  and  of  crucial  importance  for  our  purposes  is  the  connection  back  to  the  spin  -measure 
inherent  in  this  representation.  For  the  Potts  models,  this  was  first  elucidated  in  [2]  with  the  complete  picture 
emerging  in  [25].  In  particular,  for  any  site,  the  contribution  to  the  magnetization  vanishes  if  the  site  belongs 
to  a  cluster  that  is  isolated  from  the  boundary.  The  principal  objective  for  this  representation  is  the  following 
claim: 

Proposition  3.6  Let  AcZ2  be  a  finite  connected  set  and  consider  the  above  described  representation  in  A  with 
boundary  condition  0  on  dA.  Let  (a,  b)  denote  an  edge  with  a  b  and  both  a  and  b  not  belonging  to  dA.  Let  eat 
denote  the  event  that  this  edge  is  occupied  and  let  ui  denote  a  configuration  on  the  compliment  of  ( a,b ).  Then, 
for  fixed  a  and  K,  there  is  an  e(J,  A)  with  £  — >  0  as  J2/ A  — »  0  such  that  uniformly  in  A,  to  and  ©  -  as  well  as 
K  and  a, 

Vf(eab)  < 

Proof.  Let  W®(-)  denote  the  configurational  weights  with  associated  boundary  conditions  as  described  above. 
Then  it  is  seen  that 

Pa  (w  V  eab)  _  WgjwVeab) 

1-P  ®(WVeab)“  Wf  (w) 

Our  goal  is  to  estimate  the  right  hand  side  of  the  above  which  thereby  generates  the  quantity  e  featured  in  the 
statement  of  this  proposition.  Noting  the  positivity  and  product  structure  of  the  numerator  and  denominator, 
we  may  regard  the  object  on  the  right  as  the  expectation  with  respect  to  a  weighted  measure  of  the  quantity  Rab 
and  we  shall  denote  this  by  Ew(i?aih).  The  latter  will  be  estimated  via  conditional  expectation:  Let  Qqa  denote 
a  specification  of  the  g-fields  and  agent  occupation  variables  except  for  qa  and  let 

e  :=  sup  E w(Rab  I  Qqa)- 

U,  Qga 

Obviously,  e  >  supw  Ew(i?aj,).  As  for  the  complimentary  fields,  there  is  not  a  great  deal  of  dependence:  In 
particular,  all  that  is  needed  is  that  rc  =  1  for  all  c  such  that  (a,  c)  €  oj.  Concerning  the  optimizing  w,  non-local 
considerations  dictate  simply,  that  u>  be  such  that  (a,  b)  does  not  reduce  the  number  of  components.  Locally, 
as  can  be  explicitly  checked,  or  derived  from  monotonicity  principals,  the  optimal  scenario  is  when  all  bonds 
emanating  from  a  are  present  in  the  configuration.  Thus  we  have 

/  e-(AJ+K^RA(J)R{K)e~Xc?  dq  _nf  e~Xq2  sinh4(Jg)  sinh  (Kq)dq 
£  ~  f  e-(3J+K)iR3(J)R(K)e-xv2dq  ~  "  /  e- A<?2  sinh3 (Jq)  sinh (Kq)dq  ’ 

where  in  the  first  line  R(J )  :=  R{J,q ,  1).  We  claim  that  the  final  ratio  is  bounded  by  J/ A1/2  multiplied  by  a 
constant  that  may  be  proportional  to  the  ratio  K/ A1/2.  Indeed  let  us  substitute  w  =  A  1^2q  and  k  :=  K/\ x!2 . 
The  above  quantity  can  thus  be  rewritten  as 


£  = 


fe  "2  sinhKw(sinhwJ/A)4du; 
J  e_w2  sinh/s:w(sinha;J/A)3(ia; 


Our  claim  is  obvious  if  n  — >  0  but  we  may  wish  to  consider  cases  where  n  stays  bounded  away  from  zero.  In 
general,  the  integrands  are  not  dominated  by  large  u>  and  we  may  expand  the  factors  sinhwJ/A  with  the  result 


2  J  /  e  u2  sinh  kuj  ■  uAduj 
A1/2  /  e-“2  sinh  kuj  ■  co3duj 
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We  finally  claim  is  that  the  right  side  is  bounded  by  a  linear  function  of  k: 

1  f  e~u  sinh  ku  •  co4  dui  „ 

- -  4 - 2  :  i - TTV  <  B’ 

1  +  «;  J  e  w  sinh  kw  •  w’aw 

for  some  H  <  oo.  This  is  indeed  true  as  k  — >  0.  We  only  need  to  show  that  the  inequality  holds  in  the  case 
k  — >  oo.  But  here  the  factor  e-“  sinh  kuj  is,  essentially,  a  Gaussian  in  the  variable  u>  —  n  and  the  desired  result 
follows.  □ 


Theorem  3.2.  Consider  the  Gl-system  and  let  e  denote  the  quantity  described  in  Proposition  3.6.  Then  for 
£<£o,  given  by 


there  is  a  unique  limiting  Gibbs  state  featuring  rapid  decay  of  correlations. 


Proof.  Using  the  result  of  Proposition  3.6,  we  shall  compare  the  described  graphical  representation  with  inde¬ 
pendent  bond  percolation  on  Z2.  We  start  with  a  well  known  -  and  readily  derivable  -  result:  Let  Yi, . . .  Yv 
denote  an  array  of  Bernoulli  random  variables  with  collective  behavior  described  by  the  measure  p-y  and  let  Yy 
denote  a  configuration  on  the  compliment  of  Y3 .  Let  us  now  introduce 


Pi  =  maxPY(Yj  =  1  |  Yy  ) 

*  v- 


to  denote  the  maximal  conditional  probability  of  observing  {Yj  =  1}.  Finally,  let  Xl7 . .  .Xjy  denote  a  collection 
of  independent  Bernoulli  random  variables  with  parameters  pi, . . .  ,pn-  Then,  denoting  the  independent  measure 
by  /Ltx,  we  have 

Mx  >  My- 

Thus  we  may  bound  the  probabilities  of  increasing  events  in  the  graphical  representation  by  the  corresponding 
probabilities  from  independent  percolation  on  Z2  with  bond  occupation  probabilities  determined  by  the  £  from 
Proposition  3.6.  However,  we  must  note  that  the  relevant  percolation  problem  has  multiple  types  of  edges.  The 
£o  in  the  statement  of  this  proposition  bounds  the  probability  of  the  event  e„b  U  ei la  by  If  the  clusters  of 
the  featured  representation  fail  to  percolate,  then,  as  A  Y1  Z2,  the  origin  is  disconnected  from  the  boundary 
with  a  probability  tending  to  one.  As  discussed  just  prior  to  Proposition  3.6,  this  implies  £^(770)  =  0  and  by 
Proposition  3.5,  uniqueness  is  established.  Under  the  condition  e  <  e0,  exponential  decay  of  correlations  can  also 
be  established.  We  will  be  content  with  the  decay  of  the  two  point  function.  The  problem  of  general  correlations 
under  these  conditions  has  been  treated  elsewhere  [16,  17].  In  particular,  for  i,j  £  Z2,  E(rttr/j)  in  the  unique 
infinite  volume  measure  is  bounded,  in  finite  volume  approximations  by  the  probability  that  i  and  j  reside  in  the 
same  cluster.  For  £  <  £qi  this  decays  exponentially  in  | i  —  j\  uniformly  in  A  for  |A|  sufficiently  large.  □ 

We  now  turn  our  attention  to  an  alternative  criterion  for  high  temperature  behavior  which  may  also  be  of 
relevance  in  a  sociological  context:  Sparsity  of  agents.  Mathematically,  this  pertains  to  the  situation  where  a  is 
large  and  negative  (—a  1)  which  a  priori  suppresses  the  fraction  of  agent  occupied  sites.  Our  arguments  will 

initially  be  based  on  more  primitive  notions  of  percolation  and,  following  the  methods  of  [20]  (see  also  [18,  19]) 
could,  perhaps,  be  completed  along  these  lines.  However,  it  turns  out  to  be  far  simpler  to  appeal  to  the  graphical 
representation  just  employed  for  the  final  stage  of  the  argument.  We  start  with  the  relevant  notion  of  percolation 
and  connection.  In  the  context  of  site  percolation  on  Z2,  we  may  define  various  notions  of  connectivity.  Here 
we  define  o-connectivity  to  indicate  connection  between  sites  that  are  no  more  than  two  lattice  sites  away.  This 
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is  not  to  be  confused  with  ^-connectivity  which  does  not  consider  a  pair  of  sites  to  be  connected  if  they  are 
separated  by  two  units  in  the  vertical  or  horizontal  direction.  We  denote  by  p*  the  threshold  for  o-percolation  on 
Z2.  Standard  arguments  dating  to  the  beginning  of  the  subject  show  that  p*  €  (0, 1);  in  particular,  p*  is  less  than 
the  threshold  for  ordinary,  or  even  ^-connected,  percolation  and  mean-field  type  bounds  readily  demonstrate 
that  p*  >  ■ 

The  next  proposition  concerns  the  relative  abundance  of,  e.g.,  red  sites  under  the  condition  — a  1  with  the 
other  parameters  fixed. 

Proposition  3.7  Consider  the  Gl-system  with  parameters  X,  K  and  J  fixed.  Then  there  is  a  Sa  =  6a(J,K,  A) 
with  6a  — >  0  as  a  — ►  — oo  such  that  uniformly  in  volume  and  boundary  conditions,  for  any  site  i  that  is  away 
from  the  boundary 

P ®  (.Vi  =  1)  <  Sa- 

Proof.  Here  we  employ  the  preliminary  (red  y  blue)  FKG  properties  that  were  established  earlier,  in  3.2.1.  We 
start  with  a  7  >  0  (and  somewhat  “large”)  and,  for  j  €  A  not  too  near  the  boundary,  we  consider  P ®(pj  >  7). 
By  the  FKG  property,  this  probability  is  less  than  the  corresponding  conditional  one  given  that  rjj  =  1  and  that 
r]k  =  1  for  all  k  that  are  neighbors  of  j .  This  conditional  probability  is  given  by  a  definitive  expression: 


Wf(9j  >7)  < 


f  e+CJ+K)ge  \g- dg 

Jg>  7 _ 7  s: 

J  e+(4J+K)ge-\gidg  ' 


The  above  can  be  expressed  directly  via  the  error  function  but  in  any  case,  as  is  not  hard  to  show, 


A  <Ie— ^-^l2, 


as  long  as  7  >  (4J  +  K)/ 2A,  which  also  quantifies  how  large  7  must  be.  Provided  i  is  a  few  spaces  away  from 
the  boundary,  we  note  that  1  —  5<f7  is  a  valid  estimate  of  the  probability  that  both  gi  and  the  p-values  at  the 
neighbors  of  i  do  not  exceed  7.  Let  us  denote  this  (good)  non-high  field  event  by  G*.  Then  we  may  write 


P f(Vi  =  1)  =  P?(Gi)P?(»7i  =  1  |  Gi)  +  P?(G?)P®(»fc  =  1  |  Gf) 

e(4J+/C)7e-a 

<  0(57  +  1  e(4J+if)7g-a  _|_  e-(4J+iC)7e-a  ' 


(4) 


where,  in  various  stages  we  have  employed  worst  case  scenarios.  Clearly,  for  fixed  ( J,  K ,  A)  we  may  choose  7  large 
so  that  S7  is  small,  and  a  negative  and  large  in  magnitude,  so  that  5a  is  small.  □ 

Theorem  3.3.  Consider  the  Gl-System  and  suppose  that  —a  is  large  enough  so  that  5a  <  p*  as  described  just 
prior  to  the  statement  of  Proposition  3.7.  Then  there  is  a  unique  limiting  Gibbs  state  featuring  rapid  decay  of 
correlations. 


Proof.  By  the  dominance  principle  stated  at  the  beginning  of  the  proof  of  Proposition  3.2  if  Sa  <  p£,  the  red 
agents  fail  to  exhibit  o-percolation  regardless  of  boundary  conditions.  Now  consider,  in  the  context  of  the  bond 
representation,  the  event  that  the  origin  is  connected  to  OA  in  the  red  boundary  conditions,  which  represents 
the  sole  non-vanishing  contribution  to  E^(r/o  =  +1)-  The  bonds  of  any  path  connecting  the  origin  to  dA  within 
this  cluster  may  be  envisioned  as  alternating  connections  between  agents  and  fields;  the  connection  to  the  red 
boundary  ensures  that  both  types  of  entities  take  on  the  red  color.  In  particular,  all  the  agents  in  the  cluster  are 
red  so  that  these  agents  must  (at  least)  form  a  o-connected  cluster.  Hence,  in  finite  volume,  we  may  bound 
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Ea(?7o  =  +1)  <  Pa(0  dA) 

o,R 

where  {0  3A}  is  the  event  of  a  red  o— connection  between  the  origin  and  the  boundary. 

o,R 

When  the  red  agent  occupation  probabilities  are  dominated  by  independent  sites  with  parameter  5a  <  p%, 
such  probabilities  decay  exponentially.  Evidently,  in  the  limiting  state,  the  “magnetization”  vanishes  which  by 
Proposition  3.5  implies  a  unique  state.  Similarly,  exponential  decay  of  correlations  is  implied  by  exponential 
decay  of  o-connectivities.  □ 

4.  The  Mean  Field  Rendition 

In  the  previous  section,  we  showed  that  a  phase  transition  between  well-mixed  and  clustering  configurations  exists 
for  the  general  Hamiltonian  in  Eq.  (1).  However,  finding  the  exact  or  even  approximate  values  of  the  J,  K,  a,  A 
parameters  for  which  the  well-mixed  to  clustering  transition  occurs  is  in  general  a  difficult  task.  Moreover,  the 
nature  of  the  transition  is  not  elucidated  by  the  techniques  of  the  preceding  section.  On  the  basis  of  informal 
simulations  and  certain  other  considerations  it  appears  that  the  transition  may  be  discontinuous  or  second  order 
depending  on  where  the  phase  boundary  is  crossed.  This  cannot  be  proved  in  the  context  of  the  present  model. 
We  thus  introduce  a  mean-field  Hamiltonian,  where  instead  of  nearest-neighbor  interactions  we  consider  an  all— 
to-all  (interaction)  coupling  that  is  rescaled  by  the  number  of  sites.  Models  of  this  sort  are  often  referred  to  as 
complete-graph  systems.  The  mean-field  Hamiltonian  allows  us  to  define,  in  the  thermodynamic  limit,  a  simple 
mean  field  free  energy  per  particle.  This  free  energy  can  be  subjected  to  exact  mathematical  analysis  which 
provides  a  quantification  of  the  phase  transition.  In  particular,  we  have  found  that  the  phase  boundary  between 
the  diffuse  states  and  the  gang-symmetry  broken  phase  can  indeed  be  of  either  type. 

Let  us  thus  consider  a  lattice  of  N  sites  -  where  the  detailed  geometry  is  no  longer  of  relevance.  At  each  site 
i,  there  is  the  same  .s,  =  (r)i,gi)  featured  in  the  previous  section.  However  now,  the  Hamiltonian  reads 

-^T(s)  =  ^  ~  ASi  )■  (5) 

i,3  i 

It  is  observed  that  the  couplings  J  and  K  need  no  longer  be  distinguished.  Indeed,  for  large  N,  the  gipi  interaction, 
and  any  other  particular  interaction  is  not  of  pertinence.  We  now  introduce  the  relevant  collective  quantities,  n 
G,  and  b ,  obtained  via  g ,  and  g.\,  which  will  allow  for  a  more  convenient  analysis.  In  particular,  if  N+  and  N~ 
designate  the  number  of  red  and  blue  lattice  agents,  respectively,  we  define 

,  N++N~  ,  N+  -  N~ 

b-.=  - - -  and  n  := - - - , 

as  the  fraction  of  the  lattice  covered  by  agents  of  any  type  and  the  excess  ■  positive  or  negative  -  of  this  fraction 
that  is  of  the  red  type.  Moreover,  we  introduce  G  =  ^  JA  gt  to  be  the  average  graffiti  imbalance.  In  this 
context,  n  and  G  are  akin  to  magnetizations  in  a  standard  one-component  spin  model,  with  n  corresponding  to 
magnetization  in  the  agent  variables  and  G  in  the  graffiti  field.  For  occasional  use,  we  also  define  n ±  =  N± /N  <  1. 
We  remark  that  in  these  definitions  there  is  an  implicit  N  dependence  which  is  notationally  suppressed. 

4-1.  The  partition  function 

In  the  forthcoming,  we  wTill  evaluate,  asymptotically,  the  mean  field  partition  function  ZMF  defined  in  accord 
with  the  previous  section  as  the  partition  sum  Zj^F  =  ^  ■  Here,  for  reasons  which  will  soon  become 
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clear,  we  will  treat  the  graffiti  field  variables  slightly  differently.  We  define 


dn9i 


as  the  normalized  Gaussian  measure  for  the  individual  field  variables.  Letting  g  denote  the  array  of  these  random 
variables  we  may  write 


7mf 

■'N 


:=  E„ 


i  9iVj+ocJ2i  Vi 


where  Eg(-)  denotes  expectation  with  respect  to  the  free  (independent)  ensemble  of  Gaussian  random  variables 
and  denotes  the  rest  of  the  partition  sum  i.e.,  over  the  agent  configurations.  It  is  acknowledged  that  this 
differs  from  the  prior  definitions  by  a  multiplicative  factor  of  [A/7 r]w/2  which,  of  course,  is  inconsequential. 

It  is  at  this  point,  with  the  current  formulation,  that  the  advantage  of  the  all  to-all  coupling  is  manifest: 
For  any  s  (and  any  N )  the  quantity  in  the  exponent  depends  only  on  n,  b  and  G:  Zj$F  =  Es('£2veN(JnG+ab'>). 
Concerning  the  agent  configurations,  to  perform  the  summation,  we  must  multiply  the  integrand  by  the  number 
of  ways  of  arranging  N+  red  sites  and  N~  blue  sites  among  N  possible  positions.  We  denote  this  object  by 
IF/v(6,  n)  which  is  given,  explicitly,  by  the  trinomial  factor 


Wjv(6,n)  (iV+,JV-)  (|7V(6  +  n),iiV(6-n))‘ 

As  for  the  graffiti  field  configurations,  it  is  noted  that  since  G  is  proportional  to  a  sum  of  Gaussian  random 
variables,  it  is  itself  a  Gaussian.  Indeed  the  the  mean  of  NG  is  zero  and  the  variance  is  Ar[2A]_1.  Thus  the 
expectation  over  g  can  be  replaced  with  the  expectation  over  NG  leading  to 

Z^F  =  ^EiVG[W/iv(6,n)eAr[Jf’G+ab]]  oc  ^  WN(n,  b)eN^JnG+ab-XG2UG 

n,b  n,b 


with  the  constant  of  proportionality  independent  of  N.  Now,  on  the  basis  of  the  Stirling  approximation, 


WN{b,n) 


2 


(1  -bY~b 


Thus,  modulo  lower  order  terms,  we  have  Z^F  ss  J2n  b  g  e  N<FI'b’n,G^  where  <f>,  the  free  energy  function,  is  given 
by 


e-$(b,n,G)  e(JnG+ab-XG 2) 


b  +  n 


b  —  n 


(1-6) 


i-& 


-1 


(6) 


In  accordance  with  standard  asymptotic  analysis 

1 


lim  log  2“*  =  min  $(6,  ?r,  G)  :=  FMf 

N—t  00  iV  b,n,G 


where  Fmf  =  Fmf(J,  a,  A)  is  the  (actual)  limiting  free  energy  per  site.  While  various  aspects  of  the  above 
scenario  for  all  -to-all  coupling  models  have  been  long  known  and  certain  cases  explicitly  proven  [26],  there  is  a 
general  theorem  to  this  effect  that  is  sufficient  for  our  purposes,  presented  in  Section  5  of  [9].  Thus  the  efforts  of 
a  mean-field  analysis  may  be  summarized  as  follows:  we  are  to  minimize  $(&,  n,  G)  and  the  values  of  b,  n  and  G 
at  the  minima  -  as  a  function  of  the  couplings  -  will  determine  the  various  phases  of  the  system.  Even  in  this 
simplified  context,  as  will  be  seen,  the  phase  transitions  can  be  dramatic. 


15 


4-2.  The  mean-field  equations 

The  free  energy  function  is  obviously  well  behaved  except  at  the  extreme  values  of  the  variables.  In  particular, 
we  would  like  to  assume  that  0  <  b  <  1  and  —b  <  n  <  +b  where  the  strict  inequalities  imply  that  the  function  is 
smooth.  Now  a  direct  calculation  of  the  asymptotics  makes  it  clear  that  no  minimum  could  possibly  occur  near 
the  b  =  0, 1  and  b  =  ±n  boundaries.  Thus,  we  can  confine  attention  to  the  interior  of  the  above  b  and  n  intervals 
and  proceed  by  differentiation  of  &(b,n,  G)  as  defined  in  Eq.(6).  Thus  we  arrive  at  the  mean-field  equations: 


d $ 
dG 
<9$ 
~db 
<9$ 
dn 


=  Jn  —  2A  G  =  0, 


=  a  +  log(l  -  b) 


fb  +  n\ 
\b  —  n) 


=  0, 


=  0, 


(7) 

(8) 
(9) 


Free  energy  minimization  only  occurs  for  values  of  (n,  b ,  G)  that  satisfy  the  above  system.  However,  other 
stationary  points  for  $  can  -  and,  e.g.,  in  the  case  of  discontinuous  transitions  generically  will  -  occur  so  we 
must  proceed  with  some  caution.  It  is  noted  that  Eq.(7)  allows  us  to  eliminate  G  altogether.  Defining  p  = 
we  rewrite  Eqs.  (8)  and  (9)  as 


4e2“  = 


l m  = 


b2  —  n 2 


(1-6) 


2  ’ 

b  +  n\ 
b  —  n  J 


(10) 

(11) 


The  analysis  of  this  system,  along  with  the  minimization  it  is  supposed  to  imply  will  constitute  the  bulk  of  the 
remainder  of  this  work.  Foremost,  it  is  noted  that  the  presentation  in  Eqs. (10)  and  (11)  are,  for  all  intents  and 
purposes,  the  same  as  would  have  been  obtained  from  the  mean-  field  version  of  the  so-called  BEG  model  [12], 
As  such,  some  aspects  of  the  current  problem  have  been  treated  in  [27].  However,  the  specifics  in  [27]  are  not 
readily  translated  into  that  of  the  current  work  and,  moreover,  our  conclusions  are  achieved  by  straightforward 
methods  of  analysis. 

Our  investigation  will  proceed  as  follows:  It  is  evident  from  physical  considerations,  and  the  subject  of  an 
elementary  mathematical  theorem  proved  at  the  end  of  this  subsection,  that  as  the  parameters  sweep  through 
their  allowed  values,  a  phase  transition  occurs  from  the  circumstances  where  d>  is  minimized  by  n  =  0  to  those 
where  n  ^  0  is  required.  First,  we  will  follow  the  consequences  of  the  assumption  that  this  happens  continuously: 
i.e.,  that  the  minimizing  n  goes  to  zero  continuously  through  small  values.  In  the  leading  order,  this  provides 
a  purported  phase  boundary  which  we  denote  by  the  LSP-curve.  Considerations  of  higher  order  terms  in  the 
vicinity  of  the  LSP-curve  yield  that  for  certain  portions  of  the  curve,  the  stipulation  is  self-consistent  and  for  the 
rest,  it  is  not.  Detailed  analysis  will  show  that  the  former  is  completely  consistent.  In  particular  these  calculations 
correspond  to  the  true  minima  of  the  free  energy  function.  By  contrast,  the  latter  (non-self-consistent)  portion 
is  a  consequence  of  a  discontinuous  transition  which  has  “already”  occurred  at  prior  values  of  the  parameters.  In 
particular,  the  perturbative  analysis  is  highlighting  a  local  extremum  and  not  the  true  minimum. 

We  conclude  this  subsection  with  the  derivation  of  the  LSP-curve  -  as  well  as  the  introduction  of  notation 
that  will  be  used  throughout  the  reminder  of  the  analysis.  Assuming  n  =  0,  Eq.(ll)  is  trivially  satisfied  and 
Eq.(10)  defines  the  “ambient”  value  of  6  which  we  denote  by  6^: 


bR 


2ea 

1  +  2ea  ' 
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Note  that  ( b  =  bR,n  =  G  =  0)  is  always  a  solution  to  the  mean-field  system.  For  simplicity  we  consider  bn 
and  [a  as  the  relevant  parameters  for  our  system  for  the  remainder  of  this  paper.  Let  us  now  consider  slight 
perturbations  of  b  about  bR  and  of  n  about  zero.  We  thus  write  b  =  &a(1  +  A)  with  A  <C  bR  and  \n\  >  0  with 
n  <C  1  and  obtain  the  following  approximations  by  expanding  Eq.(10)  to  lowest  order 


n 


2A- 


b2R 


1  -bT 


while  Eq.(ll),  written  to  a  higher  approximation  than  will  be  immediately  necessary,  gives  us 


(12) 


n  nA 


(13) 


We  pause  to  observe  that  Eq.(13)  and,  in  general,  Eq.(ll),  have  the  symmetry  property  that  with  all  other 
quantities  fixed,  if  n  is  a  solution  then  so  is  —n.  Thus,  we  might  as  well  assume  that  n  >  0.  Indeed,  we  shall 
adhere  to  this  convention  throughout.  Assuming  now  that  /.t  is  variable  while  bR  is  fixed,  the  n  — >  0  limit  of 
Eq.(13)  and  Eq.(12)  yields  the  tentative  phase  boundary 


Hs{bR)  = 

bR 


(14) 


This  defines  the  LSP-curve;  the  correspondingly  tentative  conclusion  is  that  n  >  0  and  b  >  bR  occurs  for  /i  >  fig 
while  for  /i  <  /is,  n  =  b  —  bR  =  0.  However,  the  viability  of  these  tentative  conclusions  depends,  in  a  definitive 
fashion,  on  the  value  of  6#.  In  particular  an  analysis  of  the  higher  order  terms  in  Eq.(13)  testifies  that  this 
picture  cannot  possibly  be  correct  for  bR  <  this  is  the  subject  of  our  next  subsection.  However,  a  more 
difficult  analysis  shows  that  this  picture  is  indeed  correct  for  bR  >  |  which  is  the  subject  matter  of  the  final 
subsection.  First,  we  must  attend  to  some  necessary  details. 


4-2.1.  Preliminary  analysis 

In  this  subsection  we  will  establish  some  basic  properties  of  the  model  such  as  the  existence  of  high-  and 
low-temperature  phases  along  with  various  nronotonicity  properties.  In  particular  we  show  that  at  fixed  bR,  the 
quantity  n,  assumed  to  be  non-negative,  is  non-decreasing  with  /i  and  strictly  increasing  whenever  it  is  non¬ 
zero.  For  the  benefit  of  our  physics  readership,  such  contentions  might  typically  be  assumed  and  consequently, 
the  entire  subsection  could  be  skipped  on  a  preliminary  reading.  However,  it  is  remarked  that  in  the  normal 
(physics)  course  of  events,  such  questions  are  most  often  settled  by  direct  perturbative  calculation.  Even  for 
continuous  transitions,  on  some  occasions,  additional  justification  is  actually  required.  Sometimes,  as  in  the 
present  work,  when  the  transition  is  discontinuous,  the  relevant  calculations  simply  cannot  be  done  analytically 
and  then,  indeed,  one  must  rely  more  heavily  on  abstract  methods. 

In  what  follows,  we  shall  work  with  the  free  energy  function  given  by  Eq.(6)  with  G  eliminated  in  favor  of  n 
according  to  Eq.(7)  and  working  with  the  parameters  /r  and  bR.  For  simplicity,  this  will  be  denoted  by  <bf,HiAt(&,  n) 
but  with  subscripts  omitted  unless  absolutely  necessary.  Thus  $(&,  n)  is  now  notation  for  the  function 

4>hH,M(6,n)  :=  --/r n 2  -  a(bR)b 

+  (nr)  “•«  (Nr) +  (^tt)  log  (^) +  (1  ~b)  loe(1  - l) ■  (15) 

It  is  clear  that  the  minimum  of  $(6,  n)  corresponds  to  the  minimum  of  the  original  three  variable  free  energy 
function  $(n,  b,  G).  In  the  following,  will  use  the  notation  n(ifi)  (with  n(/i)  >  0)  as  though  this  defines  an 
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unambiguous  function.  Of  course  in  the  case  of  phase  coexistence,  this  will  not  be  true.  In  general,  then,  n(/j) 
will  stand  for  a  representative  from  the  set  of  minimizers  at  parameter  value  /. i  and  all  of  the  results  in  this 
subsection  hold.  We  start  with  some  elementary  properties  of  the  phase  diagram  generated  by  the  corresponding 
minimization  problem. 

Proposition  4.1  Consider  4>(6,  n)  with  b n  fixed  and  p  ranging  in  [0,  oo).  Then  for  all  p  sufficiently  large, 
4>(6,  n)  is  minimized  by  a  non-zero  n  and  for  all  p  sufficiently  small,  $  is  minimized  by  (bn,  0). 


Proof.  We  begin  with  the  assertion,  gleaned  from  Eq.(8),  that  along  the  curves  n  =  0,  4>  is  minimized  by 
b  =  bn-  Thus  we  may  pick  any  fixed,  nontrivial  no,  with  0  <  no  <  bn,  and  it  is  sufficient  to  establish  that 
&(no,bn)  <  4>(0 ,bn)  once  p  is  sufficiently  large.  However,  the  desired  inequality  is  manefest  for  large  /a  since 
the  only  p  dependence  in  $  is  in  the  term  —  )  pn^  which  is,  eventually,  in  excess  of  the  differences  between  the 
/i  independent  term  and  $(0,6/?).  The  second  statement  is  proved  as  follows:  since  b  =  0  -  which  necessarily 
implies  n  =  0  -  does  not  minimize  the  free  energy  function,  we  may  use  the  variable  8  :=  n/b  so  that  Eq.(10) 
now  reads 


bpd 


As  is  well  known  from  the  analysis  of  mean-  field  Ising  systems  (and  can  be  established,  e.g.,  by  further  differen¬ 
tiation)  the  above  equation  has  only  the  trivial  solution  if  bp  <  1.  Since  b  cannot  be  greater  than  one,  the  second 
statement  has  been  proved  -  in  fact  whenever  p  <  1.  □ 


The  above  result  establishes,  in  a  limited  sense,  the  existence  of  a  phase  transition.  Here  we  will  sharpen  this 
result  by  proving  that  along  the  lines  of  fixed  bn,  there  is  a  single  transition  from  n  =  0  to  n  >  0.  This  is  an 
immediate  corollary  to  the  following  lemma  which  we  state  separately  for  future  purposes. 

Lemma  4.2  Let  4>AI( b,n )  denote  the  free  energy  function  with  bn  fixed  and  /z  (displayed)  in  [0, oo).  Then  the 
minimizing  n(p),  if  unique,  is  a  non- decreasing  function  of  p.  More  generally,  if  at  various  values  of  p,  4>M  has 
a  minimizing  set  of  n ’s  then,  if  p!  >  p,  the  minimum  of  the  minimizers  at  p'  is  greater  than  or  equal  to  the 
maximum  of  the  minimizers  at  p.  Thus,  in  general  any  possible  “choice”  of  n(p)  is  non- decreasing. 

Proof.  Let  p,p!  €  [0,  oo)  with  p!  >  p  and  let  us  denote  by  ( b',n ')  a  minimizing  pair  for  4^/  and  similarly 
for  (b,  n)  at  p.  The  key  observation  is  the  meager  /z-dependence  of  the  function  4*^.  Indeed,  = 

4>M(a :,y)  —  \(p'  —  p)y2 .  We  do  this  twice: 

<S>^(b',n)  =  $„(&',  h')  -  -  8)[n']2 

>  $M(fe,n)  -  i(/z'  -  p)[n']2 

=  $m'(M)-  y')n2  -  ^(p'  -  p)[n'}2  (16) 


leading  to  ^^(b' ,  n')  >  4^(6,  n)  +  \(p’  —  p)(n2  —  [n'}2).  This  necessarily  implies  that  [n']2  >  n2  since  otherwise, 
the  previous  inequality  would  be  strict  implying  that  ( b ,  n)  would  have  been  a  “better  minimizer”  for  4*^/  than 
(b',n').  '  □ 

Using  this  result  we  may  now  show  the  following 
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Corollary  4.3  Consider  the  mean-field  model  defined  by  the  free  energy  function  given  in  Eq.(6).  Then  for 
each  fixed  b R  £  (0, 1] ,  there  is  a  transitional  value  of  p,  denoted  by  pT(bR),  such  that  n  =  0  for  p  <  pR  and  n  >  0 
for  jJL>  pT- 

Proof.  This  follows  immediately  from  Proposition  4.1  and  Lemma  4.2  above.  □ 

Also  of  interest  is  the  following: 

Corollary  4.4  Consider  the  mean-field  model  defined  by  the  free  energy  function  given  in  Eq.(6).  Let  n(p) 
denote  any  non-negative  function  corresponding  to  a  minimizing  n  at  parameter  value  p  (usually  uniquely  deter¬ 
mined).  Then  for  p  >  pR,  the  function  n(p)  is  strictly  increasing. 

Proof.  It  is  seen  that  if  nfpr)  =  0  then  the  statement  of  this  corollary  is  self-evident  at  p  =  pr-  For  the  rest  of 
this  proof,  we  may  simply  assume  that  p  is  such  that  n(p)  >  0.  Suppose  then  that  p'  >  p  and  that  n  =  n(p)  is 
part  of  the  minimizing  pair  ( b(p),n(p ))  at  parameter  value  p.  Suppose  further  that  at  p!  the  same  n  is  also  part 
of  a  minimizing  pair.  Then  we  claim  that  the  b(p)  is  not  the  partner  at  p'  since  given  n'  -  purportedly  equal  to 
n  -  then  b'  is  uniquely  determined  by  Eq.(ll).  Upon  performing  some  algebraic  manipulations  the  latter  reads 
b'  =  n' /  tanh/Zn'.  Thus,  the  equality  n  =  n'  would  lead  to 


b'  = 


n 


tanh  p'n'  tanh  p'n  tanh  pn 


=  b 


so  that  explicitly  (b,n)  cannot  be  a  minimizer  at  parameter  value  p! .  Using  the  appropriate  b'  b,  we  would 
have 

Fmf(p)  =  3v(rc,  b')  =  $M(n,&')  --(//-  p)n 2  >  $M(n,6)  --(//-  h)n2  =  3y(n,6) 


in  contradiction  with  the  fact  that  (6,  n)  is  not  minimized  for  the  parameter  value  p! . 


□ 


4-3.  A  discontinuous  Transition  for  bn  <  | 

The  dividing  point  of  bR  =  |  along  the  LSP-curve  p  =  1  /bn  is  apparent  from  the  higher  order  terms  in 
Eq.(13).  Indeed,  supposing  p  =  l/bn  +  e  we  obtain,  with  the  additional  aid  of  Eq.(12), 

m  =  wR(b«-b  +  -  (17) 

For  bR  >  .* ,  Eq.(17)  is  consistent  (and,  as  it  turns  out  correct)  but  in  the  case  of  bR  <  |  this  equation  alone 
precludes  the  possibility  of  a  continuous  transition.  Indeed  since  we  cannot  have  n2  <  0,  the  only  logical 
consequence  of  Eq.  (17)  is  n  =  0  for  p  >  ps(bn),  he.,  the  transition  occurs  later.  But  the  lower  order  term 
insisted  that  p  =  ps{bn)  was  the  only  viable  candidate  for  a  continuous  transition.  Thus:  the  transition  cannot 
be  continuous  and,  at  least  for  bR  <  |,  the  preliminary  assumption  that  n  goes  to  zero  continuously  can  no  longer 
be  sustained.  In  particular,  for  bR  <  perturbative  analysis  will  never  be  valid  because  the  relevant  quantities 
will  never  be  small. 

This  leaves  open  the  possibility  of  a  transition  at  some  pT(bR)  that  is  different  than  ps-  We  shall  show  that 
h-T  <  Ps  as  a  direct  consequence  of  the  following: 

Proposition  4.5  Consider  the  mean-field  model  defined  via  the  free  energy  function  given  by  Eq.(6).  Then,  if 
bR  <|  at  p  =  ps(bR)  the  quantity  n  is  strictly  positive. 
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Proof.  We  expand  the  free  energy  function  b)  -  with  G  eliminated  via  Eq.  7  -  about  b  =  bR,  n  =  0  and 
along  the  curve  /. i  =  ns{bR)-  The  convenient  variables  are  now  chosen  as  b  =  bR{l  +  A)  and  n  =  bum.  We  first 
note  that  all  odd  terms  in  m  must  vanish.  In  addition,  the  term  linear  in  A  vanishes  due  to  the  stationarity  of  $ 
along  the  curve  n  =  ns(bR)  and,  as  it  turns  out,  so  does  the  term  which  is  quadratic  in  in.  This  leaves  us  with 


&(b,n)  =  $(bR,  0)  +  -bR 


\m4  H - — A2  —  to2 A 

6  1  -bR 


Examining  the  quadratic  form  in  the  variables  m2  and  A,  the  condition  for  a  local  minimum  is  that 

11  1 

6  1  -bR  >  S’ 

i.e.,  bR  >  We  return  to  bR  >  |  in  the  next  subsection.  Of  current  relevance  is  the  fact  that  for  bR  <  |,  the 
curve  n  =  b^1  is  of  a  saddle  point  nature.  This  implies  that  there  is  a  direction  of  decrease  which,  as  is  easily 
seen,  is  optimized,  in  the  physical  direction,  when  m2  =  3A.  It  is  concluded  that  under  the  stated  conditions, 
we  can  produce  a  pair  ( b ,  n )  with  n2  >  0  (and  b  >  bR)  such  that  the  free  energy  for  the  non-trivial  pair  is  lower; 
we  just  make  the  corresponding  objects  small  enough  to  withstand  the  higher  order  corrections.  Thus  the  actual 
minimum  also  must  occur  for  non-trivial  values  of  the  n  observable.  □ 


We  now  have 

Theorem  4.1.  Consider  the  mean-field  system  defined  by  the  free  energy  function  as  given  in  Eq.(6).  Then  for 
bR  <  1 ,  there  is  a  discontinuous  transition  at  some  positive  nr(bR )  <  n s(bR ). 

Proof.  That  a  transition  occurs  at  some  nr  >  0  is  the  statement  of  Corollary  4.3.  Moreover,  Lemma  4.2  and 
the  above  analysis  implies  fir  <  ns-  The  discussion  prior  to  Proposition  4.5  demonstrates  that  at  /i  =  fix,  the 
quantity  n  is  already  positive.  It  only  remains  to  show  that  the  inequality  relating  ns  and  /. it  is  strict.  To 
this  end,  let  us  reimplement  the  heretofore  unnecessary  notation  for  the  full  dependence  of  the  free  energies  on 
parameters.  We  have  learned  that  for  bR  <  there  is  an  n*  >  0  and  a  6*  (with  6*  >  bR)  such  that 

Tmf (bR ,  ns)  =  ®bR,ns(b*in*)  <  ®ba,ns(bR,0)- 

Invoking  Lemma  4.2,  it  is  now  sufficient  to  show  that  there  is  a  5[i  >  0  such  that  for  some  nonzero  n,  and  some 
6,  the  inequality  ^bR^s-s^(b,  h)  <  <S>bR,ns-&n(bR)0)  can  be  shown  to  hold.  Once  again,  the  key  is  the  simple 
dependence  of  the  free  energy  functions  on  the  parameter  /i.  Indeed,  using  and  6*  as  trials,  we  obtain 

^bR,)is-5r-(n*ib*)  =  FbR,iis  T 

while  ^bR,ij.s-Sii{bR,  0)  =  (bR,  0)  <  Thus,  the  desired  inequality  will  indeed  hold  for  all  5fi 

sufficiently  small.  □ 

4-4-  T  continuous  transition  for  bR  >  | 

The  starting  point  in  our  analysis  is  to  show  that  at  the  purported  critical  curve,  the  quantity  n  actually 
vanishes. 

Proposition  4.6  For  bR  >  ^  and  n  =  bf/1  =:  ns,  the  unique  solution  to  the  mean-field  equations  is  n  =  0  with 
b  =  bR.  In  particular ,  $bRtt_ls(bR,0)  <  <hbiiiAts (6, n)  for  any  ( b,n )  ^  (b r,0). 
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Proof.  Assuming  n  >  0  the  agent  fraction  b  can  be  eliminated  in  favor  of  the  ratio 


Note  that  while  this  is  the  same  substitution  as  before,  here  it  is  b  rather  than  n  that  is  being  eliminated. 
Notwithstanding,  9  still  satisfies  0  <  9  <  1.  In  these  variables,  the  mean-field  equations,  Eq.(10)  and  Eq.(ll) 
respectively  become 

Rd 

n  = 


R  +  Vl^W 
n  =  &R.Arctanhf? 

where  in  the  above,  R  :=  6^/(1  —  bn).  Let  us  now  define  £{9)  as 


(18) 

(19) 


m  :=  71 


Re 


(1  +  R)e  (1  +  R)9 


bR  R  +  Vi^W  R+Vi^e2  r  +  q 

where  Q  =  Q{9)  :=  \/l  —  92.  To  prove  the  current  proposition  we  need  to  show  that  for  all  9  >  0, 

Arctanh#  >  £(9) 


demonstrating  that  there  cannot  be  a  non-trivial  solution  to  the  mean-field  equations  under  the  conditions 
stated.  Note  that  for  0  <  9  <C  1  the  desired  inequality  can  be  explicitly  demonstrated.  In  general,  it  is  sufficient 
to  show,  for  0  <  9  <  1,  that  f(9)  <  1/(1  —  92),  i.e. ,  in  the  Q  variable  that 

1  ^  (1  +  R){R  +  Q  +  (1  —  Q2)/Q) 

Q 2  >  {R  +  Q)2  ' 

Although  both  sides  diverge  as  Q  — >  0  the  divergence  on  the  left  hand  side  is  clearly  stronger  so  we  actually 
only  need  consider  Q  >  0  limiting  us  to  Q  £  (0, 1).  After  some  manipulation,  the  inequality  we  need  to  prove  is 
equivalent  to 

{R  +  Q)2  >  (1  +  R){RQ~  +  Q)  =  (1  +  R){RQ~  +  Q2)  +  (1  +  R){Q  —  Q 2). 

That  is,  we  now  wish  to  show 


R{R  +  2Q  +  RQ){  1  -  Q)  >  (1  +  R)Q{  1  -  Q). 

Since  Q  /  1,  the  above  is  equivalent  to 

R2  +QR  +  R2Q  >  Q. 

Finally,  since  also  Q  <  1  it  is  enough  to  show  that  2 R2Q2  +  RQ  >  Q  i.e.,  that  2 R2  +  R  >  1  which  occurs  for 
R  >  \.  This  corresponds  to  bR  >  □ 

We  can  finally  show 

Theorem  4.2.  Consider  the  mean-field  Gl-system  defined  by  the  free  energy  function  given  in  Eq.(6).  Then, 
for  bR  >  |,  as  a  function  of  /i  with  bR  fixed,  there  is  a  continuous  transition  at  f.i  =  fis  =  1  /bR.  I.e.,  n(/j)  =  0  for 
fi  <  ns  and  n{/j,)  >  0  for  the  n-component  of  any  minimizing  pair  (n(/i),  b{/+))  while,  if  /z  f  fxs,  it  is  found  that 
n(/i)  4,  0. 
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Proof.  We  will  marshal  the  facts  at  our  disposal  and  then  proceed  in  a  more  abstract  vein  than  has  been  the 
case  in  the  more  recent  of  our  arguments.  In  what  is  to  follow,  n(/r)  and  the  corresponding  is,  once  again, 
notation  for  a  minimizing  pair  without  any  claims  to  uniqueness.  By  the  preceding  proposition,  we  know  that  at 
H  =  ns,  the  quantity  n(/z)  is  unambiguous  and  vanishes  for  /r  <  /j,g  by  Lemma  4.2.  Conversely,  for  fj,  >  /dg  we 
may  write,  adhering  to  the  notation  in  the  proof  of  Theorem  4.1,  our  usual  expression: 

$bRAb’n)  = 

For  n2  a  b  —  bg  <C  1  from  Proposition  4.5,  we  know  that  the  quantity  $bR,gs  (&>  n)  agrees  witla  <1 hR,gs  (bR,  0)  up  to 
quartic  order  in  n.  Thus  allowing  n2  <C  1  with  n2(/j,  —  /ds)  n4,  ( b  —  bn)2  we  find  a  non-zero  n  corresponding  to 
a  free  energy  lower  than  that  of  $bR,gs  (bR,  0)-  Therefore,  again  by  Lemma  4.2,  we  have  n(/i)  >  0  for  all  jj  >  US- 
It  remains  to  establish  that  n  f  0  as  /i  4-  ns-  Note,  that  along  any  decreasing  sequence  of  /Vs  the  corresponding 
possible  n’s  must  be  monotone  by  Corollary  4.4  -  or  even  Lemma  4.2  -  and  hence  n  f  0  as  [d  4-  /ug.  Now  let  us 
suppose  otherwise:  that  for  some  sequence  of  /Vs  decreasing  to  /dg  there  is  an  associated  sequence  of  minimizers, 
(b(n),n(n))  that  has  n(/x)  f  n*  >  0.  Let  6*  denote  the  associated  limit  for  the  b(fd)  along  a  further  subsequence 
if  necessary.  Since 

$bRAHn),n{n))  <  $6ji,M(6fi, 0)  =  «F&H)MS(6fi,0) 

we  would  have,  by  continuity,  &bR,/j,s  ( b *,  n*)  <  ^bR,gs  R ,  0)  indicating  that  at  /r  =  fig,  there  is  a  minimizer  with 
positive  magnetization  in  contradiction  with  Proposition  4.6  above.  It  follows  that,  under  the  stated  condition 
bg  >  y ,  the  limit  of  n(jd')  is  zero  as  /. i  — ►  fig  while  it  vanishes  below  and  is  positive  above.  By  this  (and  any  other) 
criterion,  the  transition  at  /. ig  is  continuous.  This  completes  the  proof.  □ 

5.  Discussion 

In  this  work,  we  have  formulated  a  lattice  model  for  gang  territoriality  where  red  and  blue  gang  agents  interact 
solely  through  graffiti  markings.  Using  a  contour  argument,  we  showed  that  a  phase  transition  occurs  between 
a  well  mixed,  “high-temperature”  phase  and  an  ordered,  “low-temperature”  one  as  the  coupling  parameter  J 
between  gang  members  and  graffiti  becomes  stronger  while  the  graffiti  evaporation  parameter  A  decreases.  In  the 
mean  field  limit  of  all  -to-all  lattice  site  couplings,  we  can  also  identify  the  tricritical  point  in  phase  space  that 
distinguishes  the  occurrence  of  a  continuous  phase  transition  from  a  first  order  one.  We  find  this  point  to  be 
located  at  bg  =  1/3  which  corresponds,  in  terms  of  the  original  variables  of  the  problem,  to  the  gang  proclivity 
term  a  =  — 21og2.  In  particular,  for  bg  >  1/3  the  phase  transition  is  continuous  and  occurs  at  p.  =  1/bg.  Thus, 
in  the  mean-field  limit,  for  fixed  a  >  —2 log 2  the  ordered  “low  temperature”  phase  arises  for  J 2  >  \/{e~a  +  2), 
and  the  “high  temperature”  one  is  attained  on  the  other  side  of  this  inequality.  The  transition  between  the 
two  occurs  in  a  continuous  manner  across  the  J2  =  A/(e-a  +  2)  locus.  In  the  opposite  case  of  bg  <  1/3  (or 
a  <  —2  log  2)  the  phase  transition  is  discontinuous.  Here,  we  also  are  able  to  prove  that  the  transition  between 
high  and  low  temperature  phases  occurs  not  at  /i  =  1/bg,  but  rather  along  the  n  =  Hr  <  l/&i?  curve,  so  that  the 
phase  change  occurs  earlier  in  J  and  along  a  separatrix  J2  =  J2  <  A/(e_“  +  2). 

In  the  context  of  gang-graffiti  interactions,  we  may  identify  the  low  temperature,  clustered  phase  as  pertaining 
to  a  high  level  of  antagonism  between  between  rival  gangs,  where  segregation  leads  to  conflict  along  boundaries. 
Vice  versa,  the  high  temperature,  well  mixed  configuration  can  be  interpreted  as  a  peaceful  state,  where  despite 
different  affiliations,  gang  members  share  the  same  turf.  Our  mean  field  results  indicate  that  the  confrontational 
state  is  surely  attained,  whether  in  a  continuous  or  first  order  manner,  for  J 2  >  A/(e_a  +  2),  which  represents  high 
gang-graffiti  territoriality  J,  low  external  intervention  in  graffiti  removal  A  and  high  proclivity  a  for  individuals 
to  become  gang  members.  Gang  clustering  can  be  avoided  by  intervening  in  all  three  directions:  by  externally 
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eliminating  graffiti  (A),  but  also,  from  a  deeper  sociological  point  of  view,  by  decreasing  the  lure  of  graffiti  tags 
or  of  joining  gangs  in  the  first  place  (J,  a).  The  emergence  of  a  (continuous  or  discontinuous)  phase  transition 
shows  that  it  is  possible  to  obtain  segregation  in  a  lattice  model  without  invoking  direct  agent -to-agent  coupling; 
it  is  certain  that  adding  such  coupling  terms  to  the  Hamiltonian  would  allow  for  even  more  favorable  segregation 
conditions. 

Although  our  work  was  conceived  within  the  context  of  gang  interactions,  the  proposed  model  Hamiltonian 
and  the  tools  used  are  general  enough  that  our  fundamental  results  may  be  applicable  to  several  other  contexts 
where  territoriality  is  played  out  through  markings  and  not  through  direct  contact  between  players.  Many 
animals,  among  which  wolves,  foxes  and  coyotes,  are  known  to  scent-mark  their  territories  as  a  way  of  warning 
intruders  of  their  presence  and  to  exchange  internal  communication  [38].  At  times,  buffer  zones  can  originate 
between  distinct  animal  clusters  where  prey  species,  such  as  deer  or  moose,  may  thrive  [60].  Insects,  such  as 
beetles  and  bees,  are  also  known  to  avoid  previously  marked  locations  as  a  way  to  optimize  foraging  patterns. 
Similarly  to  the  role  of  gang  graffiti  markings,  foreign  scents  lead  “others”  to  retreat  from  already  occupied  turf 
or  visited  patches.  Our  work  also  applies  to  these  contexts.  Although  some  stochastic  treatments  have  been 
recently  presented  [32],  classical  ecological  studies  of  territoriality  are  usually  carried  out  via  reaction-diffusion 
equations  where  focal  points  such  as  dens,  burrows  or  nests  are  often  included  [39,  47],  leading  to  segregation. 
Within  this  work  on  the  other  hand  -  whether  first  order  or  continuous  -  agent  clustering  is  a  natural  consequence 
of  a  probabilistic  treatment  without  the  need  to  include  any  anchoring  sites.  Finally,  we  are  able  to  connect  local 
microscopic  parameters  -  J,  K,  A,  a  -  to  the  emergence  of  large  scale  territorial  patterns,  be  they  gang  clusters 
or  animal  groupings. 
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